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Abstract 

Background: One major concern with adaptive designs, such as the sample size adjustable designs, has been the fear 
of inflating the type I error rate. In (Stat Med 23:1 023-1 038, 2004) it is however proven that when observations follow a 
normal distribution and the interim result show promise, meaning that the conditional power exceeds 50%, type I 
error rate is protected. This bound and the distributional assumptions may seem to impose undesirable restrictions on 
the use of these designs. In (Stat Med 30:3267-3284, 201 1) the possibility of going below 50% is explored and a region 
that permits an increased sample size without inflation is defined in terms of the conditional power at the interim. 

Methods: A criterion which is implicit in (Stat Med 30:3267-3284, 201 1) is derived by elementary methods and 
expressed in terms of the test statistic at the interim to simplify practical use. Mathematical and computational details 
concerning this criterion are exhibited. 

Results: Under very general conditions the type I error rate is preserved under sample size adjustable schemes that 
permit a raise. The main result states that for normally distributed observations raising the sample size when the result 
looks promising, where the definition of promising depends on the amount of knowledge gathered so far, guarantees 
the protection of the type I error rate. Also, in the many situations where the test statistic approximately follows a 
normal law, the deviation from the main result remains negligible. This article provides details regarding the Weibull 
and binomial distributions and indicates how one may approach these distributions within the current setting. 

Conclusions: There is thus reason to consider such designs more often, since they offer a means of adjusting an 
important design feature at little or no cost in terms of error rate. 
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Background 

The last few years interest in various adaptive trial designs 
has surged [1]. A greater flexibility of clinical study design 
and conduct has followed from the application of these 
new ideas [2]. In [1] adaptive designs are defined as: 

"...a clinical trial design that allows adaptations or 
modifications to aspects of the trial after its initiation 
without undermining the validity or integrity of the 
trial. " 
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More and more trials of this sort are being reported and 
regulatory bodies take an increasingly favourable view on 
them [3] . All stand to win if these designs come to optimal 
use [4]. However, some concerns have been raised. One 
of these involve the risk of inflating type I error. The cur- 
rent article will assess that risk in the context of sample 
size adjustable (SSA) designs that allow choosing between 
raising the sample size, continuing as originally planned, 
or closing the trial due to futility. 

The following will recapitulate parts of [5], add detail 
and draw conclusions for trial procedures. In that ref- 
erence the authors show that if the interim results look 
promising, no inflation of type I error rate occurs. Here 
promising' means that the conditional power at the cur- 
rent parameter estimate, i.e. the power updated by the 
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accumulated knowledge at interim, amounts to at least 
50%. This article will show that a less strict bound applies, 
in agreement with [6], exhibit the bound in terms of a test 
statistic, and present mathematical as well as computa- 
tional aspects of it. 

Methods 

Assumptions 

Denote the planned final sample size by No, the number 
of patients available at the pre-planned interim analysis by 
n, and the possible raise determined at the interim tak- 
ing conditional power into account by r. Let us consider a 
one-sided test at level a based on observing X\, . . . , Xx final . 
Here Nf ma i = N 0 or Nf ma i = N = N 0 + r, depending 
on a decision taken during the course of the trial The 
main result assumes normal distribution, but as will be 
outlined, it will still hold true for more general distribu- 
tions. Further, assume the X[ to be independent normal 
with mean 0 and variance 1. The null hypothesis states 
that 0 = 0. Define the normalised test statistic by 
Z (x) = y* =1 Xi/y/x. The test rejects if Z (N f^ l) > z a , 
where z a is the 100 x (1 — a) percentile of the standard nor- 
mal distribution: <3>(z a ) = 1 — ot (O being the cumulative 
distribution function of the standard normal distribu- 
tion). The normalised test statistic = X^i^'/V^ 
is observed when n patients have provided data, and the 
Data Monitoring Committee (DMC) will in part base its 
recommendations on the observed value. At this interim 
analysis an adaptation may lead to closing the study due 
to futility, continuing the study without changes or raising 
the sample size by recruiting an extra r subjects, yield- 
ing a total of N = No + r subjects. Closing the study 
due to futility may only decrease the type I error rate. 
So let us, for the sake of argument, disregard that pos- 
sibility, and show that the type I error rate still remains 
protected. 

The study protocol will specify n and No, and at the 
interim we will consider raising the final sample size based 
on the conditional power evaluated at the current param- 
eter estimate. Since the objective is to assess if the interim 
results are promising the current estimate of the param- 
eter of interest gives the appropriate information [6]. As 
pointed out by Muller and Schafer in [7], the over- all type 
I error can be preserved unconditionally under any gen- 
eral adaptive change, provided the conditional type I error 
that would have been obtained had there been no adapta- 
tion is preserved. This article however only considers the 
case of SSA. Unlike the situation in [8] the design does not 
permit sequential testing. Also, the article only consid- 
ers the conventional hypothesis tests and p-values without 
adjustments. 

We assess the conditional error rate as a function of r. By 
showing that the conditional type I error rate is bounded 
by the error rate which arises from the design without 



adaptation the unconditional error rate is proven to be 
controlled at a pre-specified level a. 

Derivation of the main result 

We use the notation X ~ N(fi,cr 2 ) to signify that X 
follows a normal law with mean fi and variance a 2 . 

The change in type I error rate conditional on a sam- 
ple size increase decided at the interim equals G(r) = 

p r(Z (M)+r) > Za 0n) = = Q) _ p r ( Z (No) > 

z a \Z^ = z,0 = 0). The conditional distribution equals 

0N)0n) = Z)Q = o) ~ N(pz,l - p 2 ), where 

P = Cov(J2li^/VN,J^ =1 Xj/^i) = EJLi vV^v = 
yfnjN, and similarly for Z^ N °\ 

Expressing the difference in terms of normal distribu- 
tions yields 



G(r) = O 



- O 



N 0 +r J 



N 0 Z ' 



\ 



Now, in order to show this difference to be less than 
or equal to zero it may be equivalently shown that the 
difference of the arguments is negative (in the sense of 
non-positive), and denote this by H(r). Obviously 7/(0) = 
0. 

To simplify notation put q = n/(No + r) and V = (No + 
r) /No for arbitrary n, No and r, satisfying No > n > 0, and 
r > 0. Please note qV = n/No- Then we aim to show 



< 0 



This implies 
( V% 



fqV 



V 1 - q V 1 q v 



< Z a 



And, then, multiply by —1 and divide by the multiplier 
of z, to obtain 



z > z 0 



y/T^q - y/i-qv , VWV^q - y^vi - qy 



y/1 — q<s/l — qV 



-I 



yi — q-s/l — qV 
which after cancelling out *JY — q^/1 — qV becomes 



z > Zaiy/T^q ~ yfl^qV)/(yfqVyfl^~q - yfqJl^V) 
:= z a b(q, V) 

(1) 



Now let us compare this bound to ZayfnjNo. Denote the 
bound in (1) by z a b(q, V), and set out to prove b(q, V) < 
y/qV. By subtracting y/qV from both sides and equating 
denominators we have 



VH- x/T^V- yfqV( yfqVyfl^- JqyfT^qV) 
y/qVy/l-q-^/qy/l-qV 



<0 
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But the denominator is positive under current assump- 
tions. Thus, we may disregard it, and thus, we need to 
prove 

y/l ~ q(l - qV) + y/l - qV(qW - 1) < 0 

Division by y/l — qV, moving the right term to the right 
hand side of the inequality symbol, and squaring both 
sides yields 

(1 - q)(l -qV)<l- 2qVV + q 2 V 

By expanding the left hand side product, eliminating 
terms and multiplying both sides by —l/q t we finally have 

V + 1 > 2y/V, 

which is true for all positive V. 

Now regard b as a function of r for n and A/n fix. One 
may show that, b(r) / y/n/No asymptotically as r \ 0. 
Further, b decreases as r grows in a close to linear fashion. 
Also, b (r) \ ( 1 — y/l — qV) / y/qV when r tends to infinity. 

Please note that since the b(q, V) < y/qV = y/n/No a 
sufficient but not necessary condition is z > z a y/n/No, 
which will be seen to give the conditional power 50% (the 
simple criterion). Consequently, this new criterion is less 
restrictive than the one presented in [5], and, importantly, 
changes with r. The reference [6] provides an example 
where the type I error remains intact although the condi- 
tional power descends down to 36%. 

To obtain the conditional power please note that 
£[ Z (M))| Z (") = Z) 0 = z /y/n]= zlJnJW^ and further- 
more 

(Z (M>)| Z («) = z> e = zljn) ~ N(zl4WJWv, \ - 
n/No). Then 

Pr[ Z m) > z a \Z (n) =z,e= z/Jn\ 

_ _ I z a -z/Jn/Np I 

{ J 

The minimum of this probability over z > b(q, V)z a 
equals 

^ VRF ) 

From the definition of G(r) it follows that one cannot go 
further without increasing the conditional error rate. In 
this sense the bound is optimal. 

Weibull ditributed survival time points 

We will now study the situation where survival times fol- 
low a Weibull distribution and right censoring time points 
are exponentially distributed. 

In [9] the details of an Edgeworth expansion of the prod- 
uct limit estimator are given — n~^(p(x)(K3(x 2 — 



1) + 3<7i)/6. First some notation: X = lifetime, T = left 
truncation time point, Y = right censoring time point, 
Z = min(X, Y), 8 = I(X < Y). Further, put C(z) := 
P(T < z < Z\T < Z). But since T = 0 this probability 
equals P(Z > z) = P(X > z,Y > z) = P(X > z)P(Y > z). 
Then Wi(y) := P(Z < y, 8 = 1) = P(X < y, Y > X). 
With a x (z) = f* ^§rffi, the constant k 3 in the Edgeworth 

expansion equals erf 1 (-7 Serf + J* C~ 3 (t)dWi(t)). As 
stated we assume X ~ Weib(X, /3) and Y ~ Exp(fi). 
From this follows that C(y) = exp{—\xy — ky&), W\(y) = 
fy e~ llx kfixP~ l e~ Xxfi dx. Thus one may at the interim use 
parameter estimates to calculate a normal approximation 
to the conditional power. Alternatively, one may simulate 
the remainder of the trial. A third option is to base the 
procedure on the logrank test whose statistic converges 
to a normal distribution. Consider the situation where the 
time to some event is compared between patients in an 
active treatment group and those in a control group. Let r; 
refer to the number of patients remaining at time i and 0{ 
refer to the number of observed events. Further, let A refer 
to the active treatment group and C to the control group. 

Tf T — X^ k r iA0iC-rjC0iA anr l y _ V- /c ^j^jlr^C 

then z = T/^/V will asymptotically be standard normal, 
e.g. [10]. Hence one may apply the simple criterion to z 
observed at the interim. 

Binomial proportion 

For the sake of simplicity of exposition we focus atten- 
tion to a single binomial proportion p and a one-sided 
test at the 5% level. Let the null hypothesis and alternative 
hypothesis be Ho : p = po,H\ : p > po. Please note that 
for Xn 0 ~ Bin(No,po) the conditional distribution given 
{X n = k} is the same as X^ Q - n + k ~ Bin(No — n,po), and 
similarly for Xx 0 + r . 

From this follows that we may obtain G(r) exactly: in 
terms of R code [11] 

#let the level of the test be 5% 

qs . r <-qbinom(p = 0.95, size = NO+r, 

prob = pO) 

qs <- qbinom(p = 0.95, size = NO, prob = pO) 
G.r <- pbinom (qs-k, size = N0-n,prob = pO) - 
pbinom(qs.r - k, size = NO+r-n, prob = pO) 

However, we will look at a normal approximation. In the 
binomial case several test statistics with close to normal 
distribution exist: 

1. the score test statistic: 

z = y/n(p -po)/y/po(X -po)),p = k/n 

2. the log-odds: 

z = y/np(X -p) (log(p/(l -p)) - log(p 0 /(l -p 0 ))) 
[12] 
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The simple criterion would then say that if z as above 

exceeds J~j^z ai then the procedures protects the type I 

error rate (unconditionally). But we set out to find a more 
accurate approximation. 
Now using the condition {X n = k} 

G(r) =P po (XM 0 + r - n > q a ,No+r - k) 
- P po (X No - n > q a)No - k), 

where q a>m is the 100 x (1 — a) percentile o{Bin(m,po). 

Also the binomial distribution X n ~ Bin(n,p) admits 
a normal approximation of the pivotal statistic U = 
(X n — E[X n ] )/SD(X n ), which coincides with the score test 
statistic above, such that 



P([/>^) = 1-<DU- 



J—( K3 tf-l))\+0[ - 
6^/n ) \n 



in terms of the third cumulant of U, which picks up the 
skewness. As a rule of thumb it is often said that the nor- 
mal approximation is quite accurate when np and nil — p) 
both exceed 5. But this statement holds even without the 
correction with respect to skewness. 

In this case we may approximate the difference G(r) 
defined above by 

G(r) =P po (XM 0 + r > qa,No+r\Xn = k) 
— Pp 0 (XN 0 > qa,N 0 \X n = k), 

where we insert the percentiles obtained from the inver- 
sion of the Cornish-Fisher expansion, cf. e.g. [13]: 

Ko 



qa,n = + Vncro(Za + ^=(z a ~ 1)), 

o^/n 



denoting \i n = npo, gq = ^po(l — po), and the third 
cumulant (1 — 2po)/cro by yo. This quantity will deviate 
less than 1 from the true percentile for n from 20 to 200, 
and min{npo, nil —po)} > 5. Let us consider G(r) through 
the pivotal quantities 



U N = 



qa,N 



HN-n 



We will be concerned with the difference 



qa,No+r — k — /XjVo+r-w 
GNo+r—n 
qa,No — k — I^Nq—vi 
ONo-n 



The task is now to identify when this difference is pos- 
itive. To simplify notation denote by n\ the larger of the 
two sample sizes and by n<i the smaller. After equating the 
denominators and noting fi n . — \i ni - n = npo, i = 1,2 the 
difference equals: 



- n (npp + JnjZgGo + Yocrpjzl l)/6 - k) 
- n)(n 2 - n)a 0 
_ V«i - n { n Po + s/n2Z a cro + Xo^o (z% - 1)/ 6 - k) 

Disregarding the positive denominator yields the 
condition: 

(V^2 - n - yjn\ - n)(np 0 + y ° ao ^ — - k) 

6 

+(V^2 - njn[ - <Jn\ - n^n)z a cro > 0 
Some algebra will unearth the condition 

Xo^o (z 2 a - 1) 



k >np 0 + 



6 



+ 



(V^ 2 - njn{ - - n^/jn)z a cT 0 



V^2 - n - +Jn\ - n 

Please note that the first term corresponds to expecta- 
tion of the null distribution. Further, the second term will 
be negative if po > 0.5, and the third will always be nega- 
tive under the conditions of this paper. From this follows 
that the normal approximation of G(r) is non-positive 
for k satisfying the above condition. Finally, invoke the 
fast convergence of the binomial distribution towards a 
normal law, which means that already 20 observations 
will make the normal approximation quite accurate, pro- 
vided min{np,n(l — p)} > 5. Simulations indicate that 
this decision rule is accurate already at an interim sample 
size n as low as 20. However, the rule does not guar- 
antee preservation of the conditional type I error rate 
for all p. Thus the conclusion is that for the binomial 
distribution there is no inflation of the unconditional 
type I error rate under the above conditions. A total of 
900000 simulations with n from 20 to 100, po picked 
randomly in [5/n, 1 — 5/n], k randomly generated from 
Bin(n,po) and No = 2n and r = n gave a median and 
mean of G(r) equal to -0.004762 and -0.004574, respec- 
tively, over the set defined by the inequality above. A set 
of similar simulations using the simple criterion (k > 
n(po + y/poQ- — Po))/NoZ a ) gave median and mean equal 
to -0.02429 and -0.02389, respectively. Thus the simple 
criterion will be on the conservative side. 

Results 

Main result 

In Methods the following result was derived. 

A conditional power that quarantees preservation of nominal 
significance level 

If the conditional power at the interim, which occurs after 
n out No planned observations and leads to a raise of r, 
equals at least 

t (Kq, v)/^w-iy 



(2) 
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where q = n/(No + r), V = (No + r)/No, and 



(3) 

then the type I error rate is preserved. The function b sat- 
isfies the inequalities (1 — y/1 — qV)/^/qV < b(q, V) < 
JqV = JWJWo. 

A more practical criterion, or rule of thumb, may be to 
derive a test statistic z with close to a standard normal 
distribution under the null hypothesis, and check whether 
z > y^-Za- This will be referred to as the simple crite- 
rion, and stems from [5]. More generally, the condition 
z > b(q, V)z a suffices (cf. equations (11) and (12) in 
[6]). The conditional power bound in (2) decreases as r 
increases, but the lower bound on b implies a limit. 

Example 

Take the example of n = 55, No = 110, r = 40 and 
a = 0.025, z a = 1.96. Then the minimum conditional 
power equals 43%, see next subsection. Thus a conditional 
power of considerably less than 50% is permissible from 
the point of view of type I error rate preservation. This 
may be good to know if the original sample size calcu- 
lation was grossly wrong. Then recruiting more subjects 
than planned may resolve the issue without jeopardising 
the type I error rate. On the other hand, in such a situation 
the validity of the scientific hypotheses on which the trial 
design rests may be questioned, and the sponsor will have 
to judge whether the updated hypotheses suggest a com- 
mercially viable route. Nevertheless, in some cases raising 
the sample size will make sense, and may save the trial 
from unnecessary disaster. 

Above we assume the variance to be known. If it is not 
we may estimate it and use for instance a £-test statistic 
which quickly converges to a normal as the sample size 
increases. 

Examination of the £-test has provided evidence of a 
small degree of inflation [14]. In [15] further details of 
when inflation occurs are given. However, already at a 
sample size of 30 the ^-distribution and the normal distri- 
bution appear almost identical. 

Calculations in R 

In the statistical software environment R [11] one may 
easily define functions. Let us regard the bound z a b(q, V) 
as a function of (n,No,r) instead. We may explore the 
bound z a b(n, No, r) through the R function B.func 

B . f unc <- function (n, NO, r) {q <- n/ (NO+r) ; 
V <- (NO+r) /NO; (sqrt (1-q) -sqrt (l-q*V) ) / 
(sqrt (q*V) *sqrt (1-q) -sqrt (q) *sqrt (l-q*V) ) } 

and, the minimum conditional power through 



CP.min <- function (alphal , n, NO, r) { 

za <- qnorm (1 -alphal) 

q <-n/(N0+r);V <- (NO+r) /NO 

b <- B.func (n = n, NO = NO , r = r) 

pnorm(za* (b/sqrt (q*V) -1) /sqrt (l-q*V) ) } 

So, for instance CRminfalphal = 0.025, n = 55, N0=110, 
r = 40) = 0.43, and, B.func(n = 55, NO = 110, r = 
0.01) = 0.7070907, which approximately equals ^n/N 0 = 
V55/110 « 0.7071068. Also CP.min(alphal = 0.025, n = 
55, NO = 110, r = 110) = 0.3575873. 

Deviations from normal distribution 

If we use non-normal data such as survival type of data, 
then it is often possible to approximate the test statis- 
tic by a normal variate. Many test statistics, e.g. those 
derived by the maximum likelihood method, converge 
quickly to a normal distribution when the sample size 
increases. This feature extends the relevance of the main 
result to measurements following other distributions than 
the normal. 

In Methods we looked into the situation where a 
Kaplan-Meier (KM) estimate is used. The Edgeworth 
expansion of the distribution of the (standardised) KM 
estimator has the form Q>(x)—n~^(j) (x)ks (x 2 — 1) /6, where 
K3 is specified in Methods [9,16], O the cumulative dis- 
tribution function of a standard normal variate and <p its 
frequency function. So if we express the change in condi- 
tional error rate (G(r) below) in terms of this expansion 
the correction term to difference between normal distri- 
bution functions will approach zero as l/^/n. Assuming 
some parametric distribution, such as the Weibull dis- 
tribution, one may work out the details regarding this 
approximation. Or, one may assess the deviation from 
normality through a simulation procedure. 

In the case of a single binomial proportion p and a one- 
sided test of the null hypothesis Ho : p = po versus the 
alternative hypothesis p > po, it holds that if we at the 
interim observe X n = k satisfying 



k >np 0 + 



+ 



(VN 0 - njn{ - yjn\ - n^N^)z a cr 0 



with cr 0 = p 0 (l -po), yo = (1 - 2p 0 )/cr 0 and n\ = N 0 + r, 
then inflation of the type I error rate will not occur. More 
precisely put: on average , over all possible outcomes, the 
procedure will preserve the type I error rate. However, 
the conditional error rate will not always fall below the 
nominal one. 
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Discussion 

There are operational issues with adaptive designs that 
must be addressed during the planning stage. In order 
to safeguard the integrity of the trial and avoid opera- 
tional bias following an unblinded interim precautions 
need to be put in place to limit access to both the results 
and, even, the analysis plans. The latter will specify the 
output and decision rules, but will leave open the pos- 
sibility of including other information, such as external 
factors in the final decision whether to stop for futility 
or to continue, and if so, whether or not to raise the 
sample size. 

Further, a number of concerns have been raised involv- 
ing the risk of violating statistical principles or lack 
of efficacy compared to group sequential designs, e.g. 
[17-19]. 

However valid these objections may be, more and more 
practitioners have felt that the challenges are tractable and 
have found SSA designs an attractive option. For small 
biotechnology companies this option gives the possibility 
of starting a trial with rather limited resources, followed 
by an additional investment conditional on the interim 
results being promising. Also, the SSA design makes a 
lot of sense whenever a fix size design would have to 
rely on quite limited amount of information regarding the 
primary variable. 

Several references have argued the superiority of seam- 
less phase II/III designs over the traditional phase II and 
III trials. Merging the two phases produces gains in valu- 
able time [20], and, under reasonable conditions, saves 
sample size [21]. 

Earlier research has established that a conditional power 
at the interim analysis exceeding 50% implies that the con- 
ditional, and hence also the unconditional, type I error 
rate is preserved, cf. [5,7]. Further, the reference [6] builds 
on [8] and others to identify a more general region where 
this happens. The region is identified through equations 
(11) and (12) in [6]. The derivation of the region relies 
on results for Brownian motion. Together these two 
equations implicitly define a bound that coincides with b 
in (3) above. 

Further, one cannot use a lower bound without 
risking inflation of the conditional error rate, and 
thus one may not rely on the Muller-S chafer prin- 
ciple of conditional error functions [7] (new does 
not exceed the original) to prove preservation of 
unconditional error rate 1 . By virtue of the Muller- 
Schafer principle of conditional error functions any 
interim decision rule, pre-defined or not, that does 
not violate this fundamental requirement will permit a 
redesign of the trial. So from this perspective the SSA 
designs described here are well behaved and offer great 
flexibility. 



Conclusions 

This article has shown that the risk of compromising the 
nominal significance level of a statistical test by allow- 
ing a sample size increase during the course of a trial 
remains low and controllable. The conditional error rate 
and power provide key decision tools. 

Endnotes 

1 Also, by reversing the order of terms in G(r) and 
tracing the same line of thought one may conclude that a 
sample size decrease is permissible when results are 
discouraging. But then it may make more sense to 
discontinue the trial due to futility. 
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